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Abstract. In this article we show that every group with a finite 
presentation satisfying one or both of the small cancellation con- 
ditions C"(l/6) and C"(l/4) - T(4) has the property that the set 
of all geodesies (over the same generating set) is a star-free regular 
language. Star-free regularity of the geodesic set is shown to be 
dependent on the generating set chosen, even for free groups. We 
also show that the class of groups whose geodesic sets are star-free 
with respect to some generating set is closed under taking graph 
(and hence free and direct) products, and includes all virtually 
abelian groups. 



1. Introduction 

There are many classes of finitely presented groups that have been 
studied via sets of geodesies that are regular languages (that is, sets de- 
fined by finite state automata). Various examples are known of groups 
for which the set of all geodesies is a regular set. Word hyperbolic 
groups with any finite generating set are very natural examples; for 
these the sets of geodesies satisfy additional ( "fellow traveller" ) prop- 
erties which make the groups automatic [2j Theorem 3.4.5]. Other 
examples include finitely generated abelian groups with any finite gen- 
erating set [TDJ Propositions 4.1 and 4.4]; and with appropriate gen- 
erating sets, virtually abelian groups, geometrically finite hyperbolic 
groups [TQl Theorem 4.3], Coxeter groups (using the standard gen- 
erators) j6], Artin groups of finite type and, more generally, Garside 
groups pQ (and hence torus knot groups). 

To date very little connection has been made between the properties 
of the regular language that can be associated with a group in this 
way and the geometric or algebraic properties of the group itself. In 
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this paper we consider groups whose sets of geodesies satisfy the more 
restrictive language theoretic property of star-free regularity. 

The set of regular languages over a finite alphabet A is by definition 
the closure of the set of finite subsets of A under the operations of union, 
concatenation, and Kleene closure [5j Section 3.1]. (The Kleene closure 
X* of a set X is defined to be the union U^ X* of concatenations of 
copies of X with itself.) By using the fact that the regular languages 
are precisely those that can be accepted by a finite state automaton, 
it can be proved that this set is closed under many more operations 
including complementation and intersection [5J Section 4.2]. The set 
of star-free languages over A is defined to be the closure of the finite 
subsets of A under concatenation and the three Boolean operations 
of union, intersection, and complementation, but not under Kleene 
closure [HI Chapter 4 Definition 2.1]. 

The star-free languages form a natural low complexity subset of the 
regular languages. An indicator of the fundamental role that they play 
in formal language theory is the surprisingly large variety of conditions 
on a regular language that turn out to be equivalent to that language 
being star-free. The book by McNaughton and Papert [9] is devoted 
entirely to this topic. One such condition that we shall make use of 
in this paper is the result of Schiitzenberger that a regular language is 
star-free if and only if its syntactic monoid is aperiodic. Other exam- 
ples studied in [9] are the class LF of languages represented by nerve 
nets that are buzzer-free and almost loop-free, and the class FOL of 
languages defined by a sentence in first order logic. A regular language 
not containing the empty string is star-free if and only if it lies in LF 
or, equivalently, in FOL. There are also relationships between star-free 
languages and various types of Boolean circuits; these are discussed in 
detail in the book by Straubing [13] . 

Note that A* = C is star-free. The set of star-free languages properly 
contains the set of all locally testable languages. A subset of A + := 
A* \ {e} (where e is the empty word) is locally testable if it is defined 
by a regular expression which combines terms of the form A*u, vA* 
and A*wA*, for non-empty finite strings u, v, w, using the three Boolean 
operations [TTJ Chapter 5 Theorem 2.1]. A natural example of a locally 
testable language is provided by the set of all non-trivial geodesies in 
a free group in its natural presentation. Further examples of star-free 
languages are provided by the piecewise testable languages; a subset of 
A* is piecewise testable if it is defined by a regular expression combining 
terms of the form A*a\A*a2 ■ ■ ■ A*a^A*, where k > and each ai G A, 
using the three Boolean operations [TTJ Chapter 4 Proposition 1.1]. A 
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natural example of a piecewise testable language is provided by the set 
of geodesies in a free abelian group over a natural generating set. 

Margolis and Rhodes [8] conjectured that the set of geodesies in any 
word hyperbolic group (with respect to any generating set) is star-free. 
This conjecture was motivated by an interpretation of van Kampen 
diagrams in terms of Boolean circuits, and utilization of relationships 
between properties of circuits and star-free languages mentioned above. 

We show that the conjecture as stated is false in Section HJ where 
we describe a 6-generator presentation of the free group of rank 4 for 
which the set of geodesies is not star-free. This demonstrates that the 
property of having star-free geodesies must be dependent on the choice 
of generating set. 

By contrast, our main theorem, in Section [31 states that groups de- 
fined by a presentation satisfying either one of the small cancellation 
conditions C"(l/6) or C"(l/4) — T(4) (which imply but are not a con- 
sequence of hyperbolicity) have star-free sets of geodesies with respect 
to the generating set of that presentation. Our proof relies on the thin- 
ness of van Kampen diagrams for these presentations, which was used 
in [H] to show that these groups are word hyperbolic. The question of 
whether or not any word hyperbolic group must have some generating 
set with respect to which the geodesies are star-free remains open. 

In Section [5] we consider closure properties of the set of groups that 
have star-free sets of geodesies for some generating set. We show that 
this set is closed under taking direct, free, and graph products. Our 
proof follows the strategy of the proof in [7J showing that the set of 
groups with regular languages of geodesies (for some generating set) is 
closed under graph products. 

It is proved in [TOJ Propositions 4.1 and 4.4] that every finitely gen- 
erated abelian group has a regular set of geodesies with respect to any 
finite generating set, and that every finitely generated virtually abelian 
group G has a regular set of geodesies with respect to some finite gen- 
erating set. (An example of J.W. Cannon, described after Theorem 
4.3 of [10], shows that the set of geodesies need not be regular for ev- 
ery finite generating set of a virtually abelian group.) In Section O 
we strengthen these results to show that all finitely generated abelian 
groups have piecewise testable (and hence star-free) sets of geodesies 
with respect to any finite generating set, and that all finitely generated 
virtually abelian groups also have piecewise testable sets of geodesies 
with respect to certain finite generating sets. 
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2. Some technicalities and basic results 

Let A be a finite alphabet and let M be a deterministic finite state 
automaton over A. We assume that all such automata in this paper 
are complete; that is, for every state and letter in A, there is a corre- 
sponding transition. For a state a of M and word u G A*, define o u to 
be the state of M reached by reading u from state a. The transition 
monoid associated with M is defined to be the monoid of functions 
between the states of M induced by the transitions of M. Since the 
automaton is finite, this transition monoid is a finite monoid. 

For a regular language L of A*, there is a minimal finite state automa- 
ton Ml accepting the language L, which is unique up to the naming 
of the states [5]. In this minimal automaton, no two states have the 
same "future" ; that is, the two sets of words labelling transitions from 
two distinct states to accept states must be distinct. The syntactic 
congruence of L is the congruence which relates two words u and 
v provided, for all words x and y, xuy G L if and only if xvy G L. The 
syntactic monoid associated with L is the quotient monoid A*/ 
The images [u] and [v] of u and v in the syntactic monoid A*/ ~ L are 
equal if and only if o u = o v for all states a of Ml- Thus the syntactic 
monoid of L is also the transition monoid of Ml- 

A monoid is said to be aperiodic if it satisfies a rule x N = x N+l for 
some N G N. By a result of Schiitzenberger (TTJ Chapter 4 Theorem 
2.1], a regular language L over A* is star-free if and only if its syntactic 
monoid is aperiodic. The following proposition yields an alternative in- 
terpretation of star-free, which allows an easy algorithmic check (which 
we used to check examples). 

Proposition 2.1. Let L be a regular language over A. The language 
L is not star-free if and only if, for any N G N, there exist n > N 
and words u,v,w G A* with one of the words uv n w and uv n+1 w in L 
and the other not in L. Moreover, if L is not star-free, then there exist 
fixed words u, v, w G A* such that, for each N G N, there exists n > N 
with uv n w G L but uv n+1 w ^ L. 

Proof. If L is star-free, Schutzenberger's Theorem says that the syn- 
tactic monoid is aperiodic, so there is an iV G N with x N = x N+1 for 
all x in the monoid A*/ For any u,v,w G A*, the image [v] of v in 
A*/ ~l satisfies [v] = [v] N+1 , and so a v = a v for all states o of 
the minimal automaton Ml accepting L, including the state a = erg, 
where <To is the start state. Thus for all n > N, the word uv n w will 
be accepted by Ml if and only if uv n+1 w is accepted. The proof of the 
converse is similar. 
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Now suppose that L is not star-free, and let m be the number of 
states of the automaton Ml- The first part of the proposition shows 
that there are words u,v,w G A* and an integer i > m with one of the 
words uv l w or uv l+1 w in L and the other in A* \ L. Since i > m there 
must be two integers ji, J2 with < j \ < ji < i such that cr™ Jl = a™ 32 ■ 
Let k = J2 — ji- Then for all natural numbers I, a™' w = <Jq v * + w and 
a uv*+ w _ a uv t+ + w _ g ence f or infinitely many natural numbers n > m, 
we have uv n w G L and uv n+1 w (jL L. □ 

The following lemma will be useful when considering sets of geodesies 
in subgroups. 

Lemma 2.2. Let B be a subset of A. Then B* is a star-free language 
over A. 

Proof. The set B* can be written as 

b* = (u aeA \ B rad>y, 

where S c denotes the complement A* \ S of a subset S of A*. This is 
a star-free expression for B*. □ 

In Section [5] we will utilize another equivalent characterization of 
star-free languages, shown in the following proposition. 

Definition 2.3. We define a circuit in an automaton through a state 
o to be powered if the circuit is labelled by a word v k for some v G A + 
with k > 1 such that a v ^ a v . 

Proposition 2.4. A regular language L is star-free if and only if the 
minimal automaton to recognize it has no powered circuits. 

Proof. Suppose that v k labels a powered circuit in the minimal au- 
tomaton Ml for L beginning at a state a, with k > 1 the least natural 
number such that a v = a. Let u label a route from the start state <To 
to a. Since Ml is minimal, the states a u and a uv must have different 
futures, so there is a word w such that one of uw and uvw is in L 
and the other is not. Hence for all natural numbers a, we also have 
that one of uv ak w and uv ak+l w is in L and the other is not. So, by 
Proposition l2.lt L cannot be star-free. 

Conversely, suppose that L is not star-free. Let m be the num- 
ber of states of Ml- Again using Proposition 12. 1[ there exist words 
u, v, w in A* such that uv n w G L but uv n+1 w G^ L for some n > m. 
Among the targets from the start state under the transitions labelled 
u,uv,uv 2 , . . . ,uv m there must be at least one coincidence, and hence 
there exist numbers < a < b < m such that the targets satisfy 
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a o" a = a o" ■ Since uv n w and uv n+1 w do not have the same target, nei- 
ther do uv 11 and uv n+1 , and hence we cannot have b — a + l. Therefore 
v b ~ a labels a powered circuit at <7™ a . □ 

3. Small cancellation result 

For a finite group presentation ( X \ R } in which the elements of R 
are cyclically reduced, we define the symmetrization R* of R to be the 
set of all cyclic conjugates of all words in R U R~ l . 

We define a piece to be a word over X which is a prefix of two (or 
more) distinct words in i?*. Geometrically, a piece is a word labelling 
a common face of two 2-cells, or regions, of a Dehn (or van Kampen) 
diagram for this presentation. 

Where A > 0, we say that the presentation satisfies C"(A) if every 
piece has length less than A times the length of any relator containing 
it; in other words, a word labelling a common face between two diagram 
regions has length less than A times the length of either of the words 
labelling the boundaries of the regions. 

Where q is a positive integer greater than 3, we say that the presen- 
tation satisfies T(q) if, for any h with 3 < h < q and r\, . . . , Th £ R* 
with Ti 7^ r^_ x (1 < i < h) and r\ ^ r^ 1 , at least one of the words 
r 1 r 2 , r 2 r 3 , . . ., r h ri is freely reduced. Geometrically, this implies that 
each interior vertex of a Dehn diagram with degree greater than 2 must 
actually have degree at least q. We refer the reader to [H] for more 
details on small cancellation conditions and diagrams. 

In this section we prove the following theorem, utilizing results from [13] 
which show that sufficiently restrictive small cancellation conditions 
force Dehn diagrams to be very thin. 

Theorem 3.1. Let G = (X | R) be a finite group presentation such 
that the symmetrization R* of R satisfies one of the small cancellation 
conditions C '(1 / '6) (hypothesis A), or C (1/4) andT(4) (hypothesis B). 
Then the language of all words over X that are geodesic in G is regular 
and star-free. 

Note that it is already well known that these small cancellation con- 
ditions imply word hyperbolicity (see [2]), and hence that the geodesic 
words form a regular set. 

The remainder of this section is devoted to the proof of Theorem 13. II 
Let L be the language of all geodesic words for a presentation of a 
group G satisfying the hypotheses of this theorem, and suppose to the 
contrary that L is not star-free. 

For any word y over X U X -1 , let \y\ denote the length of the word 
y, and let \y\a denote the length of the element of G represented by y. 
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By Proposition 12.11 there exist words u,v,w over X such that there 
exist arbitrarily large n with uv n ~ 1 w geodesic and uv n w not geodesic. 
Choose u, v, and w with this property such that \u\ + \w\ is minimal. 
Note that u, v, and w are each nonempty. Then there exist arbitrarily 
large n with uv n ~ 1 w geodesic and uv n w minimally non-geodesic (that 
is, all of its proper subwords are geodesic), since otherwise u',v,w or 
u, v, w' would have the property in question, where u' and w' are respec- 
tively the maximal proper suffix of u and the maximal proper prefix of 
w. 

Choose some n for which uv n w is minimally non-geodesic and uv n ~ 1 w 
is geodesic. 

Let w = w'x with x G X U X , and let t be the geodesic word 
uv n w' . Then either 



\uv n w\c = \t\ — 1, and there is a geodesic word t' ending in x^ 1 
with \t'\ = \t\ and t' —a t] or 

\uv u w\g — \t\ and there is a geodesic word t' with \t'\ = \t\ and 

t' = G tx. 



In the first case, we let T be the geodesic digon in the Cayley graph 
r of G with edges labelled t and t', and in the second case we let T 
be the geodesic triangle in T with edges labelled t, x and if . In both 
cases, the edges labelled t and t' start at the base point of T. 

Notice that none of the internal vertices of the paths labelled t and t' 
in T can be equal to each other, because such an equality would imply 
that uv n w has a proper suffix that is not geodesic, contrary to the 
choice of n. Hence in any Dehn diagram with boundary labels given by 
the words labelling the edges of T, the vertices of the boundary paths 
labelled t and t' also cannot be equal except at the endpoints. 

It follows from the proof of Proposition 39 (ii) of [H] that there is a 
reduced Dehn diagram A with boundary labels given by T, which has 
the form of one of the two diagrams (corresponding to cases (i) and (ii) 
above) in Figure 10 of that proof, reproduced here in Figure 1. 
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Case (i) 



Case (ii) 



f 

Figure 1. 

A key feature is that all internal vertices in these Dehn diagrams 
have degree 2, and all external vertices have degree 2 or 3. Notice 
that the short vertical paths joining vertices of t to vertices of t' in 
these diagrams are all pieces of the relators corresponding to the two 
adjoining regions and hence have length less than 1/6 or 1/4 of those 
relators under hypotheses A and B respectively. 

The boundary label of each region (except for those regions contain- 
ing the endpoints of t and t') has the form r = sps'p' G i?*, where s and 
s' are nonempty subwords oft and (£' respectively, and p and p' are 
nonempty pieces. We consider s to label the 'top' of r, s /_1 the bottom 
and p and p'~ x the right and left boundaries of r, respectively. We shall 
use this top, bottom, right, and left convention throughout this section 
when referring to any region or union of consecutive regions of A. 

We shall establish the required contradiction to our assumption that 
L is not star-free by identifying a union ^ of consecutive regions of 
A such that the top of ^ is labelled by a cyclic conjugate of v, the 
labels of the left and right boundaries are identical, and the bottom 
label is no shorter than v. In that case the diagram formed by deleting 
tyj from A and identifying its left and right boundaries demonstrates 
that uv n ~ 1 w is not geodesic, contrary to our choice of u, v, and w. 

To construct the top boundary of our region tyj, we start by finding 
a subword s of v n with particular properties. We define a subword s of 
v n to have the property (f) if 

(i) : some occurrence of s in v n labels the top of a region in A 
with boundary r = sps'p' and 

(ii) : \s\ > |r|/3 if hypothesis A holds, \s\ > |r|/4 if hypothesis B 
holds. 

Lemma 3.2. Provided that n is large enough, v n has a subword s 
satisfying (\). 
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Proof. Let r = sps'p' label any region in A. If \s\ < \r\/3 (resp. 
\s\ < |r|/4), then since \p\, \p'\ < |r|/6 (resp. < |r|/4), we have 

\s'\ > \s\. But, since \t\ = \t'\, then provided that n is large enough, 
this cannot be true for all regions of A whose top label is a subword of 
v n . □ 

From now on let n be large enough for the lemma above to be applied. 
In addition let n be larger than 6p, where p is the length of the longest 
relator in the presentation. 

For the remainder of this section, let s be a specific choice of a word 
which is as long as possible subject to satisfying (f). We shall call an 
occurrence of s in v n strictly internal if it does not include any of the 
first p or last p letters of v n . 

Lemma 3.3. Every strictly internal occurrence of s in v n labels the 
top of a region in A . 

Proof. Suppose not, and consider a strictly internal occurrence of s 
which is not the top of a relator. 

First note that s cannot be a proper subword of the top y of a region 
of A. For if it were, since s is strictly internal, the word y would be a 
subword of v n satisfying (f ) and have length longer than s. 

Hence there are adjacent regions of A with labels r\ = sipis^p^ and 
r 2 = S2P2S2P2 where p' 2 = p^ 1 and the vertex at the end of the path 
labelled si and at the beginning of the path labelled s 2 is an internal 
vertex of the subpath of t labelled s. 

Either (1) s\ is a proper subword of s or (2) S2 is a proper subword 
of s, or (3) s is a subword of S1S2 (see Figure 2). 



s s 





Si 

p'l Pi 


S2 

P2 P2 




Si 

P'l Pi 


s 2 

P2 P2 




S'2 

(1) 


S'l 


S2 

(3) " 



Figure 2. 



Note that it is impossible for the T(4) property to hold in any of 
these situations. A violation is provided by the two relators shown, 
together with a relator containing s. So we may assume that C"(l/6) 
holds. 

In case (1), Si is a piece, and so are p[ and pi (since the Pi and p\ are 
the labels of the vertical paths in A), and so each must have length less 



10 



S. HERMILLER, DEREK F. HOLT, AND SARAH REES 



than |r|/6. But then the path p[sipi is shorter than s[, contradicting 
the fact that t' is a geodesic. Case (2) is dealt with similarly. 

In case (3), sHsi and sDs 2 are pieces, and so must each have length 
less than \r\/6, contradicting the condition that \s\ > |r|/3. □ 

Now write t = uv p vv p w', where v = v n ~ 2p . The first occurrence of s 
in v must start before the end of the first v ; that is, v has a prefix of 
the form qs where q is a proper prefix of v. Since n > Qp, the suffix 
y n-2p-3 Q f ^ Yi^s length at least 4p — 3 > p, and so is longer than s. 
Hence this condition on n ensures that the words vqs and v 2 qs are also 
prefixes of v. Then all three of these occurrences of s are also strictly 
internal subwords in v n . By Lemma 13.3} these three occurrences of s 
are each the top label of a region of A, so these subwords must be 
disjoint. Hence t must have a prefix of the form uv p qsq'sq's, where the 
three subwords labelled s are the three discussed above, and sq' is a 
cyclic conjugate of v. 

Let the three regions of A whose tops are labelled by these s subwords 
be called $o> ^2 and $4. If q' is nonempty, let $1 and $3 be the regions 
attached immediately to the left of <3> 2 and $ 4 , respectively (see Figure 
3). Since the s subwords are strictly internal, these regions cannot 
contain the endpoints of t or t'. Then for i = 0, . . . , 4 the region <3>j has 
a boundary label of the form r« := SiPis'^. Note that so = s 2 = S4 = s. 



S2 



q' 



si 

P'l $1 Pi 



V% $2 P2 





4 



Pa- 



Figure 3. 

Lemma 3.4. We have r 2 = r 4 and p' 2 - 

Proof. Since s satisfies (f) and so is too long to be a piece, it follows 
immediately that r 2 = r±. 

We shall show that p' 2 = p' A in the case when q' is not the empty 
word; the proof in the case when q' is empty is similar. 

First we shall prove that si = S3. 

Under hypothesis A, since p\ and p[ are pieces and i! is geodesic, 
we have |pi|, \p[\ < |ri|/6 and \s[\ < [7*1 1/2, so therefore \si\ > |r x | /6. 
Thus si is too long to be a piece, so the relator r\ is is the only element 
of with prefix S\. Let y be the longest suffix of q' which is a subword 
of an element of i?*. Then S\ is a suffix of y, i.e. y = zsi, and again 
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y is too long to be a piece, so the only relators in R* containing y are 
cyclic conjugates of r x . Hence the (possibly empty) word z is a suffix 
of r\. If z were nonempty, then the last letter a of z would be a suffix 
of Pi, and the relator f = sps'p' labelling the boundary of the region 
$ immediately to the left of $1 would have top label s ending with a. 
But the first letter of p = p'f 1 would then be a -1 , contradicting the 
fact that the relator r 6 fl, must be reduced. Thus si must be the 
longest suffix of q' which is a subword of an element of in this case. 

Under hypothesis B, again let y be the longest suffix of q' which is a 
subword of an element of i?*, and write y = zs\. If z were nonempty, 
then the regions $, $1, and a third region with boundary label contain- 
ing y glued along this word to the other two regions, would violate the 
T(4) condition. Thus again s\ must be the longest suffix of q' which is 
a subword of an element of R*. 

The same argument under both hypotheses shows that s 3 is also the 
longest suffix of q' which is a subword of an element of R*, so s± — s 3 . 

Now suppose that p' 2 7^ p' A . Then one is a subword of the other, since 
both are suffixes of r 2 = r^. Suppose (without loss of generality) that 
p' A = r~ x p' 2 \ then p 3 = p\T. If r\ = r 3 then r is a prefix of s'^sipi and 
r" 1 is a suffix of p' 2 S2P2s' 2l and so t' is not freely reduced. If r\ 7^ r 3 , 
then s\p\ = s 3 pi is a piece, and under either C"(l/6) or C"(l/4) must 
have length less than |ri]/4. Then p[sipi has length less than |ri|/2, 
and s[ cannot be geodesic. □ 

Now let be the union of the regions $2 and all regions of A to 
the right of $ 2 up to but not including $ 4 . Then the boundary label 
of ^1 is s^p' A x a' x p' 2 where sq' is a cyclic conjugate of v, p' A — p' 2 , and 
tj[ is a subword of (t')^ 1 (see Figure 4 with q' ^ 1). 



S2 



>P 2 



$ 9 



$3 P3' 



Figure 4. 

Provided that n is large enough, we can define similar unions of 
regions ^2, ^3, • • •, ^i, ... of A, where is immediately to the left of 
VPi+i, and has boundary label sq'p' A 
of (tT 1 . 



1 <y' i p' 2 , where each a\ is a subword 
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We have not attempted to show that the words a[ are equal for all 
i, but since \t\ = \t'\ we cannot have |o~-| < \v\ for all i, and so there 
exists a j with \a'j\ > \v\. Now, if we remove ^ from A, the effect is 
to replace t by uv 1l ~ 1 w' and t' by a word of length at most \t\ — \v\, so 
we obtain a diagram that shows that uv n ~ 1 w is not geodesic, contrary 
to assumption. 

4. Dependence on generating set 

In this section, we give two examples of groups for which the set of 
geodesies is star-free with respect to one generating set, but not with 
respect to another. 

Free groups have star-free geodesies with respect to their free gener- 
ators, but the following example shows that this is not necessarily the 
case with an arbitrary generating set. Let 

G := (a,b,c,d,r, s \ ba 2 d = res, bd = s). 

The two relations can be written as r = 6a 2 6~ 1 c~ 1 , s = bd, and so 
they can be used to eliminate r and s from any word representing an 
element of G. Hence G is free on a, b, c and d. 

Since ba 2k d —a {ba 2 b~ l ) k bd —a (rc) k s for all k > 0, we have that 
ba 2k d is not a geodesic word. We shall now show that ba 2k+1 d is a 
geodesic word for all k > 0, which implies, by Proposition 12. 1[ that 
the set of geodesies for the group defined by this presentation is not 
star-free. 

Let w = ba 2k+1 d for some k > 0. Suppose to the contrary that w is 
not geodesic. Then there is a word x in a, b, c, d, r, s and their inverses 
satisfying l(x) < l(w) which freely reduces to w after we make the 
above substitutions to eliminate r and s. Let p a denote the number 
of occurrences of a in x, let n a denote the number of occurrences of 
a -1 in x, and similarly for the other five generators of G. Since the 
exponent sum of the a's in w is 2k + 1, and the only letters of x that 
contribute powers of a after the substitution are powers of a and r, we 
have p a — n a + 2p r — 2n r = 2k + 1. Computing the exponent sums of the 
6's, c's, and cTs in w in the same way, we obtain p b — n b + p s — n s = 1, 
Pc — n c — Pr + n r = 0, and pd — n d + p s — n s = 1, respectively. Then 

2k + 3 = l(w) > l(x) 

= Pa + n a + p b + n b + p c + n c + p d + n d + p r + n r + p s + n 
= 2n a + 2n b + 2n c + p d + n d + 2n r + 2n s + 2k + 2. 
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Therefore n a = ni = n c = rid = n r = n s = pd = 0, so the word x 
contains no occurrences of inverses of the generators of G, and also no 
occurrences of the generator d. As a consequence, then p a +2p r = 2k+l, 
p s = 1, pb = 0, and p r = p c . Hence the letter b also does not occur in 
x, the letter s occurs once, and since 2k + 1 is odd, we have p a > so 
x contains at least one a. Since the d is at the right hand end of w, 
the s must be at the right hand end of x. Hence x has the form yazs, 
where y and z are words over a, c, r. After making the substitutions 
for r and s in x, all of the powers of a in the word are positive, so the 
a in the expression yazs for x will not be cancelled after further free 
reduction, but the exponent sum of the b's to the left of this a is zero. 
Thus the resulting word cannot freely reduce to w, giving the required 
contradiction. 

Our second example is the three-strand braid group B3, which has a 
presentation ( a, b \ bob = aba). The geodesies for this group on gener- 
ators {a ±1 ,6 ±1 } are described in [12]. A reduced word is geodesic if it 
does not contain both one of {ab, ba} and also one of {a _1 6 _1 , 6 _1 a -1 } 
as subwords, and it does not contain both aba and also one of {a -1 , b^ 1 } 
as subwords, and it does not contain both a _1 6 _1 a _1 and also one of 
{a, b} as subwords. Hence the language of geodesies can be expressed 
as 



[(0W U c 6a0 c ) n (0 c a- 1 r 1 c u c r 1 cr 1 c )] c 

n [(0 c a6a0 c ) n (0 c a- 1 c U ^b^)} 
n [(0 c a- 1 r 1 a- 1 c ) n (0 c a0 c U C 60 C )] C , 



which is a star-free regular language. 

In p], Charney and Meier prove that Garside groups have regular 
geodesies with respect to the generating set consisting of the divisors 
of the Garside element and their inverses. The class of Garside groups 
includes and generalizes the class of Artin groups of finite type, which 
itself includes the braid groups. The set of divisors of the Garside 
element in the three-strand braid group B 3 is {a,b,ab,ba,aba}, and 
an automaton accepting the geodesies in the positive monoid of this 
example is calculated explicitly in Example 3.5 of pQ. We see from 
this that (ba)(aba) n (a) is a geodesic for n even, but not for n odd. So, 
by Proposition \2.1\ the language of geodesies for B 3 with this second 
generating set is not star-free. 
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5. Direct products, free products and graph products 

We start by proving the straightforward result that the class of 
groups with star-free sets of geodesies is closed under taking direct 
products. 

Lemma 5.1. If the languages Lx,L 2 of all geodesies of (Gx,Xx) and 
(G 2 , X 2 ) are star-free then so is the language L of all geodesies of {Gx X 
G 2 ,XxUX 2 ). 

Proof. The language L is the set of words over Xx U X 2 which project 
onto words in each of Lx and L 2 if we map in turn the elements of 
X 2 ,Xx to the empty strings. We show that L can be described as the 
intersection of two star-free languages, L[ and L' 2 . 

The language L[ is defined by wrapping arbitrary strings in X 2 
around the elements of X\ for each string in L x ; that is, 

L[ := {wqXxWx ■ ■ ■ x n w n \ Wi G X 2 and X\ ■ • ■ x n G Lx}. 

Thus a regular expression for L[ is found by replacing each element 
x of Xx in a star-free (regular) expression for Lx by X 2 xX 2 . Note 
that Lemma 12.21 shows that X^ is a star-free language over Xx U X 2 . 
Since star-free languages are closed under concatenation, this regular 
expression for L[ shows that L[ is star-free. The language L' 2 is defined 
similarly. □ 

It is straightforward to prove that an analogue of the above result 
also holds for free products. In fact in the next theorem we prove a 
more general result, which includes both direct and free products as 
special cases. 

Definition 5.2. Let T be a finite undirected graph with n vertices la- 
belled by finitely generated groups Gi. Then the graph product U r G 
of the groups Gi with respect to T is defined to be the group generated 
by Gx, ■ ■ ■ , G n modulo relations implying that elements of Gi and Gj 
commute if there is an edge in F connecting the vertices labelled by Gi 
and Gj . 

Thus if T is either a graph with no edges or a complete graph, then 
the graph product is the free or the direct product of the groups Gj, 
respectively. 

The word problem for graph products is studied in detail in [3] 
and [1]. If we use a generating set for LTrG that consists of the union of 
generating sets of the vertex groups Gi, then it turns out that a word w 
in the generators is non-geodesic if and only if pairs of adjacent genera- 
tors in w that lie in commuting pairs of vertex groups can be swapped 
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around so as to produce a non-geodesic subword lying in one of the 
vertex groups. This property is used in [7J to prove that the geodesies 
form a regular set if and only if the geodesies of the vertex groups all 
form regular sets. We adapt this proof to show that the same is true 
with 'regular' replaced by 'star-free'. 

Theorem 5.3. Let G\, . . . G n be the vertex groups of a graph product 
G := UrG, let Ai, . . . A n be finite inverse- closed sets of generators for 
Gi, and let Li, . . . L n be the languages of all geodesies in G\, . . . G n over 
A±, . . . A n , respectively. Let A := Uf =l Ai and let L be the language of 
all geodesies in G over A. The languages L\,...L n are all star-free if 
and only if the language L is star-free. 

Proof. First, suppose that L is star-free. For each vertex index i, a 
word over A* which is geodesic as an element of G{ is also geodesic as 
an element of G, and the language Li is the intersection of the star-free 
language L with A*. Lemma [2.21 shows that the set A* is star-free, so 
Li is also star-free. 

Conversely, suppose that each language is star-free. Let F { denote 
the minimal finite state automaton over Ai that accepts Lj. Since any 
prefix of a geodesic is also geodesic, the language Lj is prefix-closed, 
and therefore the automaton F{ has a single fail state, and all other 
states are accept states. 

Following the proof in [7J, for each % define a finite state automaton Fi 
over A by adding arrows for the generators in A \ Ai to the automaton 
Fi as follows. For each a G A\ Ai which commutes with Gj, a loop 
labelled a is added at every state of Fi (including the fail state). For 
each b e A\Ai which does not commute with Gi, an arrow labelled b 
is added to join each accept state of Fi to the start state, and a loop 
labelled b is added at the fail state. Completing the construction in [7J, 
an automaton F is built to accept the intersection of the languages of 
the automata Fi, and the authors show that the language accepted by 
F is exactly the language L of geodesies of UrG over A. 

Note that if two states of Fi have the same future, then these states 
have the same future under the restricted alphabet Aj. Thus minimality 
of the automaton Fi implies that Fi is also minimal. 

Since Li is star-free, Proposition 12.41 says that Fi has no powered 
circuits. Then the finite state automaton Fi has no powered circuit 
labelled by a word in Af. Suppose that Fi has a powered circuit over 
A + and let v be a least length word such that a power of v labels a 
powered circuit in Fj. Let a be the beginning state of this circuit, and 
let k > 1 be the least natural number such that a v = a. The states 
in this circuit must be accept states. If v contains a letter a G A\A { 
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which commutes with Gi, then the word v with the letter a removed 
also labels a powered circuit at a, contradicting the choice of v with 
least length. Then we can write v = Vxbv% such that t>i,t>2 € A* and 
the letter b 6 A \ Ai does not commute with Gj. Hence a Vlb is the 
start state a it0 of Fi and the word v 2 V\b labels a circuit at cr ii0 , and so 
v labels a circuit at a, contradicting the condition k > 1. Therefore Fi 
also has no powered circuits. Applying Proposition 12.41 again, then the 
language accepted by Fi is star-free. 

Finally, the language L is the intersection of the star-free languages 
accepted by the F iy so therefore L is also star-free. □ 

6. Virtually abelian groups 

In this section, we prove that every finitely generated abelian group 
has a star- free set of geodesies with respect to any finite generating set, 
whereas every finitely generated virtually abelian group G has some 
finite generating set with respect to which G has star-free geodesies. 

As a special case of these results, note that we can see quickly that the 
geodesic language of Z n for the standard (inverse closed) generating set 
is star-free, either via Lemma Iq~T1 or as follows. The minimal automaton 
for this language has states corresponding to subsets of the generators 
that do not contain inverse pairs, together with a fail state. At a state 
given by a subset S, the transition corresponding to a generator a will 
go either to S itself if S contains a, to S U {a} if S does not contain 
a or a" 1 , and to the fail state if S contains a -1 . Then the transition 
monoid of the minimal automaton for this language of geodesies, i.e. the 
syntactic monoid, is both abelian and generated by idempotents, and 
hence every element is an idempotent and the monoid is aperiodic. 
Schutzenberger's Theorem then says that this language is star-free. 

Our arguments in this section make use of a condition that is more 
restrictive than the star-free property, and indeed more restrictive than 
the piecewise testable property, which we shall call piecewise excluding. 
A language L over A is said to be piecewise excluding if there is a finite 
set of strings W C A* with the property that a word w G A* lies in 
L if and only if w does not contain any of the strings in W as a not 
necessarily consecutive substring. In other words, 

L = (U? =1 {A*a a A*a i2 A* ■ ■ ■ A*a ih A*}) c , 

where W = {ai, . . . , a n } and = ana i2 ■ ■ ■ for 1 < i < n. It fol- 
lows directly from the above expression and the definition of piecewise 
testable languages given in Section [T]that piecewise excluding languages 
are piecewise testable, and hence star-free. 
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We also need the following technical result, in which the set N of 
natural numbers includes 0. 

Lemma 6.1. Define the ordering ^ on W by (mi, . . . , m r ) ^ (n 1; . . . , n r ) 
if and only if mi < for 1 < i < r. Then any subset ofW has only 
finitely many elements that are minimal under 

Proof. See [21 Lemma 4.3.2] or the second paragraph of the proof of [TO] 
Proposition 4.4]. □ 

Proposition 6.2. If G is a finitely generated abelian group, then the 
set of all geodesic words for any finite monoid generating set of G is 
a piecewise excluding (and hence a piecewise testable and a star-free) 
language. 

Proof. Let A = {x±, . . . , x r } be a finite monoid generating set for G, 
and let 

U := { (ni, . . . , n r ) G N r | x™ 1 ^ 2 • • • x™ r is non-geodesic }. 

By Lemma 16.11 the subset V C U, consisting of those elements of U 
that are minimal under ^, is finite. Let W be the set of all permu- 
tations of all words x^x^ 2 • • • x™ r with (ni, . . . ,n r ) e V. Since G is 
abelian, whether or not a word over A is geodesic is not changed by 
permuting the generators in the word. Hence, by definition of minimal- 
ity under ^, any non-geodesic word over A contains a word in W as a 
not necessarily consecutive substring. Conversely, since the words in W 
are themselves non-geodesic, any word over A containing one of them 
as a not necessarily consecutive substring is also non-geodesic. So the 
set of geodesic words over A* is piecewise excluding, as claimed. □ 

Proposition 6.3. Any finitely generated virtually abelian group has 
an inverse- closed generating set with respect to which the set of all 
geodesies is a piecewise testable (and hence a star-free) language. 

Proof. The analogous result for 'regular' rather than 'piecewise testable' 
is proved in Propositions 4.1 and 4.4 of [TO]. We extend that proof. 

Let N be a finite index normal abelian subgroup in G. We choose 
a finite generating set of G of the form Z = X U Y with the following 
properties. 

(1) X C N and Y C G\N. 

(2) Both X and Y are closed under the taking of inverses. 

(3) X is closed under conjugation by the elements of Y. 

(4) Y contains at least one representative of each nontrivial coset 
of iV in G. 
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(5) For any equation w =g xy with w a word of length at most 3 
over Y, y EY U {1} and x G N, we have iGl. 

We must first show that such generating sets exist. To see this, start 
with any finite generating set Z of G and let X := Z D N, Y := Z \ X. 
Adjoin finitely many new generators to ensure that Property H] holds 
and that Y is closed under taking inverses. Since Y is finite, there 
are only finitely many possible words w in Property EJ and we can 
adjoin finitely many new generators in N to ensure that Property [5] 
holds. Now adjoin inverses of elements of X to get Property [2j Since 
iV is abelian and \G : N\ is finite, elements of N have only finitely 
many conjugates in G, and so we can adjoin finitely many conjugates 
of elements of X to Z to get Property [3J, after which X will still be 
closed under inversion. The five properties will then all hold. 

Now let L be the set of all geodesic words over Z. For i > 0, let Zj be 
the set of all words z\ ■ ■ ■ z m G Z* for which precisely i of the symbols Zj 
lie in Y, and let Lj := LflZj. Let Zj be the set of words in Z* containing 
at least i letters of Y. Then Zj := \J yu _ m& {Z*y x Z* • • ■ Z*y { Z*} is a 
piecewise testable language. The set Zj equals the intersection Zj fl 
(Zj + i) c , and so is also piecewise testable. 

First we shall show that L = L U L\ U L 2 . Property [3] implies that 
any word in L is equal in G to a word of the same length involving the 
same elements of Y, but with all of those elements appearing at the 
right hand end of the word. By Property [5j any word over Y of length 
three or more is non-geodesic. So L is contained in (and hence equal 
to) L UiiU L 2 . 

Now Property [T] ensures that an element of Li cannot represent an 
element of N, and the same is true for L 2 , because Property implies 
that a word in Z 2 that represents an element of TV cannot be geodesic. 
So L is equal to the set of all geodesic words in iV over X, and the 
set X generates the subgroup N. Then L is piecewise testable by 
Proposition 16.21 

Next we show that L\ is piecewise testable. For a fixed y G Y, 
applying the same argument as in the proof of Proposition 16.21 to X*y, 
we can show that there is a finite set W y of words over X with the 
property that a word in X*y is non-geodesic if and only if it contains 
one of the words in W y as a not necessarily consecutive substring. 

For each y G F and x G X, denote the generator in X equal in G to 
y~ l xy (which exists by Property [3]) by x y . Define W to be the (finite) 
set of words over Z of the form x\ - ■ ■ x t yx v t+1 • • • x v s , where y G Y and 
Xx ■ • • x t x t +i • • • x s G W y . Then a word in Z\ is non-geodesic if and only 
if it contains a word in If as a not necessarily consecutive substring. 



STAR-FREE GEODESIC LANGUAGES FOR GROUPS 



19 



So Li = P fl Zi, where P is a piecewise excluding language. Since Z\ 
is piecewise testable, this shows that L\ is piecewise testable. 

The proof that L 2 is piecewise testable is similar and is left to the 
reader. So L = L U L\ U L 2 is piecewise testable. □ 
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